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1. Summary 


1.1. Five different geographies (National Parks, Community Health Partnerships, 
Westminster Parliamentary Constituencies, Scottish Parliamentary 
Constituencies and Multi-Member Wards) were constructed from three types 
of small area geography - Postcode Areas, Census Output Areas and Data 
Zones, using the 2001 boundaries for each. 


1.2.Using the 2001 Census populations of the small area geographies, we 
estimated populations for the 5 large geographies. For each geography, this 
resulted in three different population estimates — one based on Postcodes, 
one based on Output Areas and one based on Data Zones. 


1.3. The populations based on Postcode estimates were assumed to be the most 
accurate, as Postcodes are the smallest area for which populations are 
available and so fit most precisely into larger geographies. The accuracy of 
Output Area and Data Zone based population estimates was then compared 
against Postcode Based estimates. 


1.4. Output Area based population estimates are very close to Postcode based 
population estimates for all geographies. 


1.5.Data Zones produce very close estimates for Community Health 
Partnerships, Westminster Parliamentary Constituencies and Scottish 
Parliamentary Constituencies. They can also be used to produce close 
estimates for National Parks and MMWs - with an average (absolute) 
difference of 396 between Data Zone populations and Postcode populations 
for Multi-member Wards. 
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2. 


Introduction 


2.1 


2.2. 


. The purpose of this report is to detail the work carried out in August 2007, on 


analysing how the populations of five different non-standard geographies 
varied if they were constructed out of Postcodes, Output Areas or Data 
Zones. There is an increasing demand for non-standard geography 
populations and this work was carried out in order to establish whether they 
could be built up from Data Zones or Output Areas (as Data Zones 
populations are updated every year, it would be very useful if they could be 
used to construct different geographies). 


This work was carried out in conjunction with Scottish Neighbourhood 
Statistics (SNS), Scottish Executive Geographic Information Service (SEGIS) 
and the Geography and Population & Migration Statistics branches of the 
General Register Office for Scotland (GROS). 


2.3.The purpose of the work was to assess how well three small area 


geographies — Postcodes, Output Areas and Data Zones could be matched 
to 5 different non-standard geographies, through seeing how the population 
estimate changed depending on which small area was used. 


2.4. Population estimates were based on 2001 Census population estimates for 


Postcodes, Output Areas & Data Zones. 


2.5. Non-standard geographies that were not based on Postcodes, Output Areas 


or Data Zones were chosen — and each of the three ‘small area’ geographies 
was then mapped to each of the non-standard geographies. The non- 
standard geographies investigated were: 


e National Parks — There are 2 National Parks in Scotland — The Cairngorms 
National Park (created in 2003) and The Loch Lomond and The Trossachs 
National Park (created in 2002). Their boundaries were set by wide-spread 
public consultation with interested groups. 


e Community Health Partnerships (CHPs) — There are 40 Community Health 
Partnerships in Scotland, for the most part, these have been based on the 
32 Local Authorities however 3 Local Authorities (Fife, Glasgow and 
Highland) were split into multiple CHPs. 


e Westminster Parliamentary Constituencies (WPCs) — Since 2005 there 
have been 59 Westminster Parliamentary Constituencies. Their 
boundaries are set by the Boundary Commission for Scotland and are 
based on Electoral Wards. 


e Scottish Parliamentary Constituencies (SPCs) — There are 73 Scottish 
Parliamentary Constituencies — they were set in 1990 by the Boundary 
Commission for Scotland. 


e Multi-Member Wards (MMWs) — There are 353 Multi-Member Wards that 
were created in 2007 by the Local Government Boundary Commission as 
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a replacement for the Electoral Wards. Their boundaries are loosely based 
on the old Electoral Ward boundaries. 


2.6. Three small area units were used to construct the non-standard geographies 


2.1 


2.8. 


2.9 


listed above, all using their 2001 boundaries (as Postcode and Output Area 
populations are only available in the 2001 Census). 


e Postcodes - In 2001, there were 139,045 Postcodes — these are the 
smallest geographical area for which population data is available (they 
only have the numbers, no age / sex breakdown). These were assumed to 
provide the best estimate for the non-standard geographies. 


e Output Areas — Output Areas are the standard geographies for reporting 
the 2001 Census results and contain an age / sex breakdown of the 
population. There are 42,604 Output Areas, constructed from Postcodes. 


e Data Zones - There are 6,505 Data Zones, Data Zones are the standard 
Small Area geography used by the Scottish Government and were 
constructed from Output Areas in 2001 — again, these have an age / sex 
breakdown of the population. 


.Although each 2001 Postcode fitted perfectly into a 2001 Output Area, which 


in turn fitted perfectly into 2001 Data Zones, this had no bearing on how each 
was assigned to non-standard geographies, as the population weighted 
centroid of each small area unit was used to assign them to non-standard 
geographies, without reference to the small area hierarchy of Postcodes 
within Output Areas within Data Zones. 


Population weighted centroids are a standard GIS procedure for assigning 
the population of a small geography to a large geography if the small 
geography does not wholly fit within the boundaries of the large geography — 
or lies across the border of two large geographies. The population weighted 
centroid is essentially the point in the area where population density is the 
same all around the point. For many cases, this will be somewhere with the 
highest population density within the area. The small geography is assigned 
to whichever larger geography the population weighted centroid lies in. 


.As Postcodes are the smallest geographic area, in practice, the Postcode 


construction of non-standard geographies was the closest to their actual 
boundaries. Therefore, the Postcode based population estimates were 
considered closest to the actual populations of the non-standard geographies 
(however, most indicators, including age and sex breakdowns are only 
available at Output Area and / or Data Zone level). 
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3. Method 


3.1. 


3.2. 


3.3. 


3.4. 


3:5; 


3.6. 


3.7. 


For each non-standard geography, three look-up tables (showing how each 
type of small area unit could be used to construct the larger non-standard 
geographies) were provided by the Geography branch of GROS. Following 
SNS policy, the population weighted centroid of each small area unit 
(Postcodes, Output Areas & Data Zones) was used to assign it to the non- 
standard geographies. 


Each Postcode, Output Area and Data Zone could only be assigned to one 
area in each non-standard geography set. Each of the three types of small 
area unit was assigned to the non-standard geographies independently of 
how other types of small area units were assigned. 


For example, Postcode ‘X’ could be contained within Data Zone ‘G’ and 
assigned to Multi-Member Ward ‘A’. However, this assignation would be 
based entirely on where the population weighted centroid of Postcode 'X' fell, 
regardless of whether Data Zone ‘G’ was assigned to Multi-Member Ward ‘A’ 
or not. 


Using the 2001 populations of each Postcode, Output Area and Data Zone 
(from the 2001 Census Household Populations) and the look-up of each 
small area unit to each non-standard geography, populations for each 
construction of each non-standard geography were produced. For Data 
Zones and Output Areas, age and sex breakdowns of the population were 
also available. Postcodes are too small to provide this level of detail at, so 
just the total population of each Postcode is available. 


For example, for each Community Health Partnership, the populations for the 
geography were constructed from Postcodes, Output Areas and Data Zones. 
Therefore, for each non-standard geography, three different population 
estimates were produced. 


Once the population estimates were completed for each non-standard 
geography, they were compared against each other, using the estimates 
based on Postcodes — assumed to be the closest estimate to the actual 
population. 


The results for each non-standard geography are listed fully in separate 
Excel workbooks (as reproducing the entire set would take up a large amount 
of space). The summarised results are presented below. 





6 of 19 





4. Results - National Parks: 


4.1.The table below shows compares different population estimates for National 
Parks. 


Table 1 - National Park Population Estimates 


National Postcode Output Area Difference - OA I Data Zone based Difference - DZ 
Park based based Postcode (96) population | Postcode (96) 
population | population 


The 
Cairngorms 
National Park 
The Loch 
Lomond and 
The 
Trossachs 
National Park 





4.2. Overall, the differences in estimated populations for National Parks are 
relatively small (reflecting their relatively remote locations). 
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5. Results - Community Health Partnerships: 


5.1. 


5.2. 


5.3. 


1) 
a 
I 
o 
- 
o 
o 
z 





CHPs are generally based on local authority boundaries which are a standard 
geography (which match Data Zone boundaries and, by extension, Output 
Area and Postcode boundaries). For 26 of the 40 CHPs, the populations are 
the same, whether they are constructed from Postcode, Output Areas or Data 
Zones. The other 14 arise from CHPs with have been constructed from ‘split’ 
Local Authorities — or where the CHP boundary does not exactly match the 
local authority boundary. 


Table 2 below shows the CHPs where the population estimates do not 
match. In the case where a local authority has been divided up, aggregating 
the CHPs back to local authority level resolves the differences in population 
estimates (for example, the three Highland CHPs in the table above). In 
cases where the problem is a slight mismatch between local authority 
borders and CHP borders, adding the two local authorities together (and so 
removing the border) resolves the error (for example, Aberdeen City and 
Aberdeenshire CHPs above). 


Chart 1 below details the frequency of proportional differences between Data 
Zone based estimates vs. Postcode based estimates and Output Area based 
estimates vs. Postcode based estimates. 


Chart 1) Frequency of Proportional Differences of CHP Populations (Compared to 
Postcode Populations) 
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Table 2 - CHP Population Estimates 





CHP 


Aberdeen City 
CHP 
Aberdeenshire 
CHP 

East Glasgow 
CHCP 

North Glasgow 
CHCP 

West Glasgow 
CHCP 

South East 
Glasgow 
CHCP 

South West 
Glasgow 
CHCP 

West 
Dunbartonshire 
CHP 

Argyll & Bute 
CHP 

Mid Highland 
CHP 

North Highland 
CHP 

South East 
Highland CHP 
West Lothian 
CHCP 
Edinburgh 
North CHP 


Postcode 

based 

population 
212,098 
226,898 
127,321 


100,538 


133,478 


101,939 


114,593 


93,395 
91,289 
86,653 
38,669 
83,592 


158,720 


448,618 


Output 
Area 
based 
population 
212,125 
226,871 
127,323 
100,521 


133,493 


101,863 


114,669 


93,378 
91,306 
86,653 
38,669 
83,592 
158,714 


448,624 


Difference 
- OAI 
Postcode 
(%) 





Data Zone | Difference 


based - DZ/ 
population | Postcode 
(%) 


212,125 
226,871 
127,556 

99,821 


133,960 


101,176 


115,356 


93,378 
91,306 
87,215 
38,462 
83,237 
158,714 


448,624 


Average population of a Community Health Partnership area = 126,550 
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6. Results - Westminster Parliamentary Constituencies: 


6.1.As the WPC boundaries were drawn up by the Boundary Commission for 
Scotland, they have no relation to any standard geography, so their borders 
can cut across Postcode, Output Areas and Data Zones. 


6.2.When population estimates based on Output Areas are compared to 
population estimates based on Postcodes, 9 are the same, 24 WPC have 
higher Output Area based population estimates and 26 have higher Postcode 
based population estimates. 


6.3.When comparing Output Area derived estimates to Postcode derived 
estimates, the largest positive difference comes to 425 extra people in the 
Paisley & Renfrewshire North WPC (+0.49%) and the largest negative 
difference comes to 425 fewer people in Paisley & Renfrewshire South WPC 
(-0.50%). On average, the differences between Output Area and Postcode 
derived populations are small — around 0.12 per cent per WPC. 


6.4.When population estimates based on Data Zones are compared to 
population estimates based on Postcodes, 8 are the same, 25 WPC have 
higher Data Zone based population estimates and 26 have higher Postcode 
based population estimates. 


6.5.When comparing Data Zone derived estimates to Postcode derived 
estimates, the largest positive difference comes to 3,004 extra people in the 
Glasgow North West WPC (+3.7%) and the largest negative difference 
comes to 2,443 fewer people in Glasgow North WPC (-3.5%). 


6.6.On average, the differences between Data Zone and Postcode derived 
populations are around minus 0.53 per cent per WPC. Assuming a cut-off of 
plus or minus 2.5 per cent for Data Zone vs. Postcode populations, only 2 
WPCs fall outside the cut-off points. 


6.7.Charts 2a below details the frequency of proportional differences between 
Data Zone based estimates vs. Postcode based estimates and Chart 2b 
details Output Area based estimates vs. Postcode based estimates. 


6.8. The quartile absolute differences and quartile percentage differences of the 
estimates based on Data Zones or Output Areas against estimates based on 
Postcodes for WPCs are shown in Table 3 below. 
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Table 3 - WPC Population Estimates 


Output Area vs. Data Zone vs. | Output Area vs. Data Zone vs. 
Postcode (Abs. Postcode (Abs. Postcode (96 Postcode (96 


Q2 (Median) 


Q3 





Average population of a Westminster Parliamentary Constituency area = 85,980 
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7. Results - Scottish Parliamentary Constituencies: 


7.1. Similarly to WPCs, SPC boundaries were drawn up by the Boundary 
Commission for Scotland and, as a result, there are several differences in the 
population estimates, depending on which small area unit was used to 
estimate them. 


7.2.When population estimates based on Output Areas are compared to 
population estimates based on Postcodes, 6 are the same, 34 SPCs have 
higher Output Area based population estimates and 33 have higher Postcode 
based population estimates. 


7.3.When comparing Output Area derived estimates to Postcode derived 
estimates, the largest positive difference comes to 457 extra people in the 
Glasgow Kelvin SPC (+0.7%) and the largest negative difference comes to 
458 fewer people in the Glasgow Anniesland SPC (-0.7%). On average, the 
differences between Output Area and Postcode derived populations are small 
— around 0.13 per cent per SPC. 


7.4.When population estimates based on Data Zones are compared to 
population estimates based on Post Codes, 5 are the same, 35 SPCs have 
higher Data Zone based population estimates and 33 have higher Postcode 
based population estimates. 


7.5.When comparing Data Zone derived estimates to Postcode derived 
estimates, the largest positive difference comes to 1,036 extra people in the 
Dumfries SPC (+1.3%) and the largest negative difference comes to 1,036 
fewer people in the Galloway & Upper Nithsdale SPC (-1.6%). 


7.6.On average, the differences between Data Zone and Postcode derived 
populations are around 0.51 per cent per SPC. Assuming a cut-off of plus or 
minus 2.5 per cent for Data Zone vs. Postcode populations, no SPCs fall out 
of the cut-off points. 


7.7.Charts 3a below details the frequency of proportional differences between 
Data Zone based estimates vs. Postcode based estimates and Chart 3b 
details Output Area based estimates vs. Postcode based estimates. 


7.8. The quartile absolute differences and quartile percentage differences of the 
estimates based on Data Zones or Output Areas against estimates based on 
Postcodes for SPCs are shown in Table 4 below. 
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Chart 3b) Frequency of Proportional Differences of SPC Populations (Compared to 


Postcode Populations) - Output Area 
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Table 4 - SPC Population Estimates 


Output Area vs. Data Zone vs. | Output Area vs. Data Zone vs. 
Postcode (Abs. Postcode (Abs. Postcode (% Postcode (% 


Q2 (Median) 


Q3 





Average population of a Scottish Parliamentary Constituency area = 69,340 
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8. Results - Multi-Member Wards: 


8.1.As a relatively small geography, with non-standard boundaries drawn up by 
the Local Authority Boundary Commission, matches to MMWs are likely to be 
quite difficult. 


8.2.When population estimates based on Output Areas are compared to 
population estimates based on Postcodes, 11 are the same, 174 MMWs 
have higher Output Area based population estimates and 168 have higher 
Postcode based population estimates. 


8.3.When comparing Output Area derived estimates to Postcode derived 
estimates, the largest positive difference comes to 371 extra people in the 
Lenzie and Kirkintilloch South MMW (+2.6%) and the largest negative 
difference comes to 304 fewer people in the Partick West MMW (-2.6%). On 
average, the differences between Output Area and Postcode derived 
populations are around 0.50 per cent per MMW. 


8.4.When population estimates based on Data Zones are compared to 
population estimates based on Postcodes, 9 are the same, 174 MMWs have 
higher Data Zone based population estimates and 170 have higher Postcode 
based population estimates. 


8.1.When comparing Data Zone derived estimates to Postcode derived 
estimates, the largest positive difference comes to 1,486 extra people in the 
Dunfermline North MMW (+12%) and the largest negative difference comes 
to 1,505 fewer people in the Rutherglen Central & North MMW (-11%). 


8.2.On average, the differences between Data Zone and Postcode derived 
populations are around 3.07 per cent per MMW. Assuming a cut-off of plus or 
minus 2.5 per cent for Data Zone vs. Postcode populations, 150 MMWSs fall 
out of the cut-off points. 


8.3.Charts 4a below details the frequency of proportional differences between 
Data Zone based estimates vs. Postcode based estimates and Chart 4b 
details Output Area based estimates vs. Postcode based estimates. 


8.4.The Min, 10th, 25th, 50th, 75th, 90th & Max percentile absolute differences 
and quartile percentage differences of the estimates based on Data Zones or 
Output Areas against estimates based on Postcodes for MMWs are shown in 
Table 5 below. 
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Table 5 - MMW Population Estimates 


Output Area vs. Data Zone  vs.| Output Area vs. Data Zone vs. 
Postcode (Abs. Postcode (Abs. Postcode (96 Postcode (96 


2596 (Q1) 


5096 
(Median) 


7596 (Q3) 
9096 


Max 





Average population of a Multi-member Ward area = 14,340 
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9. Summary & Conclusions: 


9.1. The average proportional difference of the estimates based on Data Zones or 
Output Areas against estimates based on Postcodes are shown in the table 
below. 


Table 6 - Summary Table 


Output Area based estimates Data Zone based estimates 


Non- Average No. of areas Av. difference | No. of areas Av. difference 
standard Population | with different to population | with different to population 
Geography populations to | estimated by | populationsto | estimated by 
Postcode postcodes Postcode postcodes 
based (90) based (96) 
estimates estimates 


National 
Parks (2) 15,510 21 2 (100%) f 2 [2 (100%) 


CHPs (40) 126,550 | 11/40 (27.5%) : 14 / 40 (35%) 


WPCs (59) 85,980 | 50/59 (84.7%) 51 / 59 (86.4%) 


SPCs (73) 69,340 | 67/73 (91.8%) 68 / 73 (93.2%) 


342 / 353 344 / 353 
MMWs (353) 14,340 (96.9%) (97.5%) 





9.2. Assuming that Postcode based estimates are the closest available estimates 
to the true population of non-standard geographies, then Output Areas (only 
available for the 2001 Census) provide estimates that are very close to 
Postcode based estimates for all geographies. 


9.3. Data Zones (available from 2001 — current) can be used to produce very 
close estimates for CHPs, WPCs and SPCs. They can also be used to 
produce close estimates for National Parks and MMWs (averaging 3% 
difference in population estimates). 
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